SN 10/053,884 



Remarks 

This supplemental preliminary amendment is filed for the purpose of making 
correction to the prior art cited on page 2 of the specification. Consideration and allowance 
of the claims is earnestly solicited. No new matter is submitted since the web site added 
simply presents background information with respect to the previously recited W4F. 

Attached hereto is a marked-up version of the changes made to the specification and 
claims by the current amendment. The attached page is captioned " Version with markings to 
show changes made ." 
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VERSION WITH MARKINGS TO SHOW CHANGES MADE 
In the Specification: 

Paragraph beginning at line 1 of page 2 has been amended as follows: 
There are solutions that standardize various file formats to one and the same, processable 
form. For instance XWrap Elite (an extensible Wrapper General System Elite Version) is a 
software application that tries to identify repeated data structures in HTML documents and to 
create specific modification rules, whereby their contents can be translated to the XML 
5 language. However, the operation of the application is bound to the HTML structure. 
Before the priority date of the present application, the XWrap Elite-program has been 
described in more detail at the address http://www.cc.gatech.edu/projects/disl/XWRAPElite/. 
Another corresponding solution based on the HTML structure is the W4F - World Wide Web 
Wrapper Factory, which before the priority date of the present application is described in 

10 more detail at the address http://www.tropea-inc.com/technology/W4F/. After the priority 
date of the present application corresponding information has been made available at another 
network address, which is http://citeseer.nj.nec.com/did/95215 and also at 
http : //cheops . cis . upenn . edu/ W4F . In addition, the Xerox Research Center Europe has a 
system that was made public in the spring of 2000 for generating data translation scripts, 

15 called Wrapper Generation via Grammar Induction; also this system is bound to the HTML 
format. Already before the priority date of the present application, said system was 
published at the addresses http://turing.wins.uva.nl/-ragetli/documents/chidlovskii.ps and 
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http://turing.wins.uva.nl/ ~ ragetli/ecml2000/ecml00a/. All of these existing solutions are 
based on the HTML format, wherefore their area of usage is fairly limited. There also is a 
patent related to this subject, i.e. US 6,151,608 that describes how data is modified to a 
database table without writing code. 



